2025.10.21 | 模型不懂光影折射;小模型也能写报告
Description
本期的 13 篇论文如下:
[00:21 ] 🪞 PICABench: How Far Are We from Physically Realistic Image Editing?(PICABench:我们离物理真实的图像编辑还有多远?)
[01:04 ] 🤖 DeepAnalyze: Agentic Large Language Models for Autonomous Data Science(DeepAnalyze:面向自主数据科学的智能体大模型)
[01:50 ] 🗜 Glyph: Scaling Context Windows via Visual-Text Compression(Glyph:通过视觉-文本压缩扩展上下文窗口长度)
[02:23 ] 🔍 Towards Mixed-Modal Retrieval for Universal Retrieval-Augmented Generation(面向通用检索增强生成的混合模态检索研究)
[03:10 ] 🔗 When to Ensemble: Identifying Token-Level Points for Stable and Fast LLM Ensembling(何时集成:定位Token级位置实现稳定高效的大模型集成)
[04:09 ] 🎯 Annotation-Efficient Universal Honesty Alignment(注释高效型通用诚实对齐)
[04:49 ] 🖌 Uniworld-V2: Reinforce Image Editing with Diffusion Negative-aware Finetuning and MLLM Implicit Feedback(Uniworld-V2:借助扩散负感知微调与MLLM隐式反馈强化图像编辑)
[05:46 ] 👁 RL makes MLLMs see better than SFT(强化学习让多模态大模型看得比监督微调更清楚)
[06:33 ] 🚀 Visual Autoregressive Models Beat Diffusion Models on Inference Time Scaling(视觉自回归模型在推理时扩展上击败扩散模型)
[07:09 ] 🎨 ConsistEdit: Highly Consistent and Precise Training-free Visual Editing(ConsistEdit:面向MM-DiT的高一致免训练视觉编辑)
[07:56 ] 🔄 Deep Self-Evolving Reasoning(深度自演化推理)
[08:22 ] 🧠 Beyond Pipelines: A Survey of the Paradigm Shift toward Model-Native Agentic AI(超越流水线:模型原生智能体AI范式转移综述)
[09:07 ] 🔮 Chronos-2: From Univariate to Universal Forecasting(Chronos-2:从单变量到通用预测)
<figure>
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递